Two Stage Approach to Document Retrieval using Genetic Algorithm
نویسنده
چکیده
─ Retrieval of relevant documents from a large document collection is a challenging task. Document Retrieval is concerned with indexing and retrieving documents provided in a document collection. Documents are represented by document descriptors which are defined as terms or keywords extracted from the textual documents. Formulating an optimal query with a set of document descriptors involves searching a huge search space for the better permutation and combination of terms. As Genetic Algorithm is well suited for searching huge search spaces, in this paper, a two stage method is proposed for efficient information retrieval system using genetic algorithm. In our proposed method, Genetic Algorithm generates the best combination terms from a set of the document descriptors. Index Words─ Genetic algorithm, Document retrieval, Term co occurrences,
منابع مشابه
Document Image Retrieval Based on Keyword Spotting Using Relevance Feedback
Keyword Spotting is a well-known method in document image retrieval. In this method, Search in document images is based on query word image. In this Paper, an approach for document image retrieval based on keyword spotting has been proposed. In proposed method, a framework using relevance feedback is presented. Relevance feedback, an interactive and efficient method is used in this paper to imp...
متن کاملCombination of Genetic Algorithm With Lagrange Multipliers For Lot-Size Determination in Capacity Constrained Multi-Period, Multi-Product and Multi-Stage Problems
Abstract : In this paper a meta-heuristic approach has been presented to solve lot-size determination problems in a complex multi-stage production planning problems with production capacity constraint. This type of problems has multiple products with sequential production processes which are manufactured in different periods to meet customer’s demand. By determining the decision variables, mac...
متن کاملApplying Machine Translation to Two-Stage Cross-Language Information Retrieval
Cross-language information retrieval (CLIR), where queries and documents are in di erent languages, needs a translation of queries and/or documents, so as to standardize both of them into a common representation. For this purpose, the use of machine translation is an e ective approach. However, computational cost is prohibitive in translating large-scale document collections. To resolve this pr...
متن کاملWeighting in Information Retrieval Using Genetic Programming: A Three Stage Process
This paper presents term-weighting schemes that have been evolved using genetic programming in an adhoc Information Retrieval model. We create an entire term-weighting scheme by firstly assuming that term-weighting schemes contain a global part, a term-frequency influence part and a normalisation part. By separating the problem into three distinct phases we reduce the search space and ease the ...
متن کاملA multi-objective genetic algorithm (MOGA) for hybrid flow shop scheduling problem with assembly operation
Scheduling for a two-stage production system is one of the most common problems in production management. In this production system, a number of products are produced and each product is assembled from a set of parts. The parts are produced in the first stage that is a fabrication stage and then they are assembled in the second stage that usually is an assembly stage. In this article, the first...
متن کامل